Do hierarchical structures assemble best via hierarchical pathways? 
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Hierarchically structured natural materials possess functionalities unattainable to the same components organized or mixed in 
simpler ways. For instance, the bones and teeth of mammals are far stronger and more durable than the mineral phases from 
which they are derived because their constituents are organized hierarchically from the molecular scale to the macroscale. Mak- 
ing similarly functional synthetic hierarchical materials will require an understanding of how to promote the self-assembly of 
structure on multiple lengthscales, without falling foul of numerous possible kinetic traps. Here we use computer simulation to 
study the self-assembly of a simple hierarchical structure, a square crystal lattice whose repeat unit is a tetramer. Although the 
target material is organized hierarchically, it self-assembles most reliably when its dynamic assembly pathway consists of the 
sequential addition of monomers to a single structure. Hierarchical dynamic pathways via dimer and tetramer intermediates are 
also viable modes of assembly, but result in general in lower yield: these intermediates have a stronger tendency than monomers 
to associate in ways not compatible with the target structure. In addition, assembly via tetramers results in a kinetic trap whereby 
material is sequestered in trimers that cannot combine to form the target crystal. We use analytic theory to relate dynamical 
pathways to the presence of equilibrium phases close in free energy to the target structure, and to identify the thermodynamic 
principles underpinning optimum self-assembly in this model: 1) make the free energy gap between the target phase and the 
most stable fluid phase of order k^T, and 2) ensure that no other dense phases (liquids or close-packed solids of monomers or 
oligomers) or fluids of incomplete building blocks fall within this gap. 
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1 Introduction 

Since assembly pathway strongly determines the ability of 
structures to self-assemble efficiently, it is vital to understand 
what rules link building block design, assembly pathway, and 
assembly yield. While some structures assemble via two-step 
mechanisms involving intermediate structures distinct from 
both the parent and target structures 1 , an important dichotomy 
separates even those pathways whose intermediates remain 
commensurate with the target structure: assembly may oc- 
cur either directly via sequential accumulation of monomers, 
as in classical theories of nucleation and growth^, or hier- 
archically via formation of intermediate- scale structures that 
act as the building blocks for subsequent stages of assembly. 
In general, hierarchical pathways dominate whenever clusters 
meet via diffusion faster than they exchange monomers^, so 
in principle any system's assembly pathway can be adjusted 
by tuning binding energies^. For instance, the pathway for 
phase separation of binary fluids changes with increasing su- 
percooling from direct assembly to hierarchical coalescence 
of clusters 67 . While binary fluids demix more efficiently via 
their direct pathway, it seems reasonable to expect that ma- 
terials with hierarchical structure might assemble more effi- 
ciently via hierarchical pathways. Nature abounds with mate- 
rials such as collagen 8 , ename FH vir us capsids 10 , cytoskele- 
tal structures^, and biominerals 12 l 13 i whose functionality de- 
pends on their having structure on multiple length scales^. 
Our pursuit of similarly functional hierarchical materialsEH23 
would be aided by knowing which dynamic modes of assem- 
bly to promote, and which to suppress. 



Here we ask the question: do hierarchical structures assem- 
ble best via hierarchical pathways? We can begin to answer 
this question through the intensive study of computer models 
inspired by real systems. A simple example of a natural hierar- 
chical material is provided by the SbpA surface-layer (S-layer) 
protein, which forms a membrane on the outsides of the bac- 
terium Lysinibacillus sphaericus and on surfaces in vitr d 2l ™ 24 \ 
This membrane, which is robust and controllably porous, is a 
square crystal lattice whose repeat unit is a tetramer, and so is 
a member of perhaps the simplest class of hierarchical materi- 
als, a one-component structure possessing order on two length 
scales (that of the monomer and that of the tetramer). Here 
we use computer simulation to study the self-assembly of a 
model of this system, broadly varying model parameters to 
alternately favor hierarchical or direct assembly pathways, in 
order to determine which dynamical pathways result in most 
reliable assembly of the target material. We also use analytic 
mean-field theory to relate these dynamical pathways to the 
model's underlying thermodynamics, thereby establishing a 
set of simple principles that describe where in phase space 
optimal self-assembly occurs. 

The constituent monomers of our model associate in three 
distinct ways (Fig. [TJ: via chemically and directionally spe- 
cific interactions that promote the formation of tetramers; via 
specific interactions that bind tetramers together (and which 
on their own promote the formation of dimers); and via non- 
specific interactions that promote the formation of extended 
liquid and close-packed solid phases. Varying the strengths of 
the two specific interactions leads to three types of pathways 
for the assembly of the target structure: when the tetramer- 
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Fig. 1 Model geometry. Two chemically specific attractive 
interactions, a tetramer- stabilizing interaction e tet between patches 
of type 'E' and 'S\ and a dimer- stabilizing interaction e^im between 
patches of type 'L,' make possible the self-assembly of monomers 
(lower left) into an open square lattice of tetramer s (upper right). 
Specific interaction strengths may be tuned to favor different 
self-assembly pathways: non-hierarchical assembly involving the 
association of monomers (diagonal), assembly via the formation of 
tetramer s (lower right), and assembly via the formation of dimer s 
(upper left). In addition, a nonspecific attractive interaction £ n 
(range depicted by dashed rectangle, lower right) allows the 
unstructured aggregation of monomers, dimers, and tetramers, and 
makes possible extended liquid and solid close-packed phases. 



forming interaction is strong, assembly occurs hierarchically 
via the formation of the tetramer repeat unit; when the dimer- 
forming interaction is strong, assembly occurs hierarchically 
via the formation of dimers; and when the two interaction 
strengths are balanced (in a roughly 1:2 ratio), assembly oc- 
curs non-hierarchically via the association of monomers. 

In a previous paper 25 we focused on the non-hierarchical 
regime and determined how to optimize assembly by adjust- 
ing the balance of specific and nonspecific interactions. We 
found that assembly is best when 1) the free energy gap be- 
tween the target phase and the most stable fluid phase is of 
order k^T\ and 2) monomers associate in a weak nonspecific 
fashion, without promoting liquid- vapor phase separation. Al- 
though the target structure is an extended crystalline one, its 
self-assembly is in important ways more like the self-assembly 



of model virus capsids^"^- whose components require a 
balance of interaction strength and specificity in order to as- 
sociate reliably - than the crystallization of spherical colloids, 
because the latter form of assembly is facilitated by liquid- 
liquid criticality 30 or demixing 31 . 

In this paper we vary all three interaction energies in order 
to determine what additional design principles apply when hi- 
erarchical assembly pathways are possible. The two design 
principles established in our previous work continue to hold. 
We also find that despite the hierarchical structure of the tar- 
get material, optimal assembly occurs non-hierarchically, via 
sequential addition of individual building blocks to a larger 
structure. Assembly via dimer or tetramer intermediates leads 
in general to assembly of reduced quality, even though these 
intermediates are commensurate with the target structure. This 
reduction in quality stems from two types of kinetic trap- 
ping. The first involves the nonspecific aggregation of dimers 
or tetramers, which, because of the larger energy scales in- 
volved, occurs more readily than the nonspecific aggrega- 
tion of monomers. The second, which occurs along tetramer- 
dominated pathways, involves the rapid formation of trimers 
(and the simultaneous depletion of the monomer pool), which 
cannot be combined to produce the target structure. This 'in- 
complete building block' kinetic trap is a lso se en in com- 
puter simulations of virus capsid assembly^EHES We use 
analytic theory to identify the thermodynamics underpinning 
these traps, enabling us to refine design rule 2) for this model 
system: assembly is poor if liquid, close-packed solid, or 
incomplete-building-block phases lie lower in free energy than 
the parent fluid and within about &b T of the stable square lat- 
tice. 

2 Model and methods 

We used the model studied in RefsJSED illustrated in 
Fig. [T] Hard rectangular monomers of width a and length 
la (I = 11/5) live on a smooth, two-dimensional substrate. 
Monomers are designed to form a square lattice whose repeat 
unit is a tetramer. The square lattice is stabilized by two types 
of chemically specific interactions, each mediated by a square- 
well attraction of range a/ 5. One is a tetramer- stabilizing at- 
traction of strength — e te t^B^, which binds together patches 
labeled 'E' and 'S' on neighboring monomers. The other is 
a dimer- stabilizing attraction of strength — Sdim^B^, which 
binds together two patches labeled 'L'. (These interactions 
were called £j nt and £ ext in Refs. 25 36 .) The dimer- stabilizing 
attraction binds neighboring tetramers into a square lattice. 

In addition to their specific interactions, monomers possess 
a nonspecific pairwise attraction of energy — £ n k^T if circum- 
scribed rectangles of width a + 2A and length la + 2A, where 
A = a/5 (dashed rectangle in Fig. [T]) overlap. The nonspe- 
cific interaction allows formation of the dense amorphous pro- 
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Fig. 2 Good assembly is localized in phase space. Phase diagram and dynamic yield after 10 7 MC cycles as a function of specific interaction 
strengths e tet and £^ m for a concentration = 0.1 and one choice of nonspecific interaction strength, £ n = 2. Solid grey curves denote 
mean-field phase boundaries for the labeled simulated coexistence combinations. The dashed black (white) curves denote lines of constant B\ 
(driving force Peak yield is localized and does not fall within a slot of constant B\. A line of constant driving force & w k^T is a better 
predictor of (and a necessary condition for) good assembly. For this set of parameters, yield is best in the non-hierarchical sector (bottom left 
of the assembly region). The symbols connect with their counterparts in Figs. [3| and |4| 
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tein clusters seen in AFM images of SbpA self- assembly^, 
and allows formation of macroscopic liquid phases implicated 
in protein self-assembly generally ^ 30 ! 38 ! 39 !. In general, the 
nonspecific interaction mediates transitions from gas to liquid 
to solid, while specific interactions determine whether those 
phases consist of monomers, dimers, or tetramers. 

We solved the model's thermodynamics using the ana- 
lytic mean-field theory described in Ref. 25 , whose predic- 
tions closely match the results of direct equilibrium simula- 
tions. We used this theory to calculate the free energies of 
8 possible bulk phases: fluids of monomers, dimers, trimers 
and tetramers; close-packed monomer, dimer and tetramer 
structures; and the target square lattice structure. We de- 
fine the thermodynamic driving force for assembly, as 
the free energy gap between the stable square lattice struc- 
ture and the most stable fluid phase. In addition, we char- 
acterized the effective strength of pairwise interactions using 
the reduced second virial coefficient B\ = B 2 / 'i?!j ardcore ^ w h ere 
B 2 = (47T) -1 JdrijdGij (1 - e~P Ui J). This integral runs over 
all angles and distances for which two monomers (called i and 
j) interact; Uij is the monomer-monomer interaction energy, 
^hardcore is defined similarly, but for monomers whose attrac- 
tive interactions have been switched off. 

We determined self-assembly dynamics using virtual-move 
Monte Carlo simulations^^ (using the version of the algo- 
rithm described in Ref. 41 ) of 1024 monomers at constant 
packing fraction, starting from well-mixed conditions. We 
employed the parameterization established in Ref .El to model 
overdamped motion in solution, ensuring that bound clusters 
of hydrodynamic radius R diffuse according to the Stokes so- 
lution for spheres of radius R. Taking a = 3.9 nm, T = 300 
K, and solution viscosity 7] = 1.00 x 10 -3 Pa s, each Monte 
Carlo (MC) cycle corresponds to 2.42 ns. As in RefsJSES 
we quantified square lattice assembly by recording the scaled 
yield / = /3C/3/C/3 + fi)) 2 , where f 3 and f 2 are the frac- 
tion of monomers with two and three specific bonds satis- 
fied, respectively. The squared factor rewards crystal domains 
with large bulk-to-boundary ratios and penalizes systems with 
many small or stringy domains. However, since it also penal- 
izes systems with many free tetramers, we checked that our 
conclusions continue to hold when replacing / with the un- 
sealed yield fe. The supplemental figures display alternate 
versions of figures using this replacement. 

3 Results 

Assembly pathways and modes of kinetic trapping vary 
widely throughout parameter space. Fig. [2] shows a color 
plot denoting the yield of the target square lattice structure ob- 
tained after long (10 7 MC cycles or 24.2 ms) dynamical simu- 
lations, overlaid on the equilibrium phase diagram. Grey lines 



on the phase diagram (derived from our analytic theory) show 
regions of stability of individual or coexisting phases, whose 
nature is illustrated by the associated simulation snapshots. 
The diagram is obtained by varying the specific interactions 
£dim and £ te t for one choice of nonspecific interaction strength, 
£ n = 2, and packing fraction, (f) =0.1. Yield is appreciable 
only in the upper right quadrant of the diagram, where both 
types of specific interactions are strong enough for the square 
lattice to be stable. (While the other quadrants are not the 
focus of this paper, the phase behaviors found there are rich, 
comprising homogeneous fluid, liquid, and close-packed crys- 
tal phases of of monomers, dimers, and tetramers.) Within the 
region of square lattice stability, assembly is generally good 
along the contour & = 2&b T (corresponding to one of the de- 
sign rules determined in our previous wortf^), and is best at 
the lower left corner of the contour. 

This localization can be understood by considering the dy- 
namics of assembly pathways, which vary widely in charac- 
ter throughout this diagram. This variation is illustrated by 
snapshots in Fig. [3] and time courses of several important mi- 
croscopic environments in Fig. [4] The symbols - green star, 
orange triangle etc. - are common to Figs.[2j|4] 

When £dim and £ t et are balanced in a roughly 2:1 ratio 
(e.g. at the green star on Fig. [2]), assembly occurs non- 
hierarchically, via the association of monomers. Yield is 
high in this regime where the thermodynamic driving force 
for assembly is between about 1 and 2 k^T (note the green 
spots denoting high yield on Fig. [2]). Yield declines slightly 
in the nucleation regime near the phase boundary (e.g. at 
the blue circle) because of the lesser thermodynamic driv- 
ing force for assembly, even though such assembly tends to 
result in only a single crystalline cluster. Similar observa- 
tions have been made in other simulation models^Iill Ki- 
netic trapping occurs when the driving force for assembly is 
large (e.g. at the red diamond). Here, monomer-monomer 
contacts are too large in energy for mistakes in binding to 
be corrected before structures grow appreciably. These mis- 
takes result in the accumulation of 'misbound' monomers, 
those making a dimer- stabilizing bond but only one of two 
tetramer-stabilizing bonds, and the consequent assembly of 
gel-like structures with no counterpart on the thermodynamic 
phase diagram. The accumulation of 'misbound' monomers 
is depicted in the second panel of Fig.|3](c) and quantified by 
the red diamond trace in Fig.|4jf). 

When £dim is large (e.g. at the orange triangle), monomers 
rapidly form dimers which subsequently associate to form the 
target crystal structure. Even for an optimal thermodynamic 
driving force of 1 to 2 &b T, yield in this regime is slightly 
less than for the optimal non-hierarchical pathway, because 
dimers have a stronger tendency than monomers to associate 
nonspecifically: there are more ways two dimers can asso- 
ciate in a manner not commensurate with the target structure. 
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Fig. 3 Snapshots illustrate the diversity of self-assembly pathways seen throughout Fig. |2j Time-ordered snapshots along assembly 
trajectories for (a) best assembly conditions and (b-f) four other points from Fig. [2] From left to right, the panels show the system after 10 5 
MC cycles (242 ^s); important type(s) of microscopic environments seen in each trajectory; the system after 10 6 MC cycles (2.42 ms); and 
the system after 10 7 MC cycles (24.2 ms). Color code: crystalline monomers, with all three specific bonds satisfied, are green; monomers with 
both tetramer- stabilizing bonds satisfied are tan; monomers with its dimer- stabilizing bond satisfied are grey; monomers with only one 
tetramer- stabilizing bond satisfied are pale blue; and monomers with no specific bonds satisfied are dark blue. Symbols at left connect with 
their counterparts in Figs.[2|and|4| 
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Fig. 4 77m£ evolution of microscopic environments reveals the 
variety of assembly pathways and modes of kinetic trapping seen 
throughout Fig. [2] Each panel depicts the time evolution of a 
particular microscopic environment for the five points in phase 
space marked in Fig. [2] and illustrated in Fig. [3] (a) crystalline 
monomers having all three specific bonds satisfied; (b-d) aggregated 
monomers, dimers, and tetramers, respectively, defined as 
monomers with no specific bonds, one external bond, and both 
internal bonds satisfied, plus at least two additional nonspecific 
bonds; (e) trimers, defined as monomers belonging to a 
three-member cluster bound by specific interactions; and (f) 
'misbound' monomers, defined as monomers having a 
dimer-forming but only one tetramer-forming bond satisfied. 
Symbols at left connect with their counterparts in Figs.|2|and[3| 



Further, the larger energy scales involved - mediated by 4 par- 
ticles instead of 2 - mean that binding mistakes are less read- 
ily corrected, hampering crystal growth and preventing effi- 
cient coarsening of nuclei. In particular, we find that dimers 
tend to aggregate nonspecifically with each other and coat the 
boundaries of growing crystal nuclei without reorienting effi- 
ciently to form specific bonds, reminiscent of the idea of sur- 
face "blocking" by nonspecifically-bound proteins discussed 
in Ref Such aggregation is depicted in the second panel 
of Fig. [3] (d) and quantified by the orange diamond trace in 
Fig.gjc). 

Similar microscopies plagues the tetramer-dominated path- 
ways, when £ te t is large (e.g. at the black square on Fig. [2]): 
tetramers also aggregate more readily than monomers. Note 
the peak associated with the black square trace in Fig.|4jd). In 



addition, the formation of trimers, which are not commensu- 
rate with the target structure, and the simultaneous depletion 
of the monomer pool, leads along tetramer-dominated path- 
ways to an 'incomplete building block' kinetic trap f amiliar 
from the study of virus capsids on the computer 27 l 32 n 35 [ At 
the position marked by the black square on Fig. [2] the trimer 
acts more as a natural kinetic intermediate between dimers 
and tetramers than as a true 'trap' (note the peak in the black 
square trace of Fig.[4jc), which diminishes at late times), but 
the associated trap becomes increasingly prominent as the 
energy scale associated with the tetramer-forming interaction 
increases. The second panel of Fig. [3] (e) depicts an aggregat- 
ing tetramer and a trimer at early stages of assembly. 

Weak nonspecific interaction is generally beneficial, but di- 
minishes the relative effectiveness of hierarchical pathways. 

In Refs. 12331 we found that weak nonspecific interactions facil- 
itate assembly by allowing the strongest interactions (the spe- 
cific ones) to be made weaker while maintaining an optimal 
thermodynamic driving force, thus enabling more efficient an- 
nealing of defects. In Fig. [5ja) we show that a similar trend 
is seen in the space of the two specific interactions: as non- 
specific interaction £ n increases from to k^T, best yield in- 
creases markedly. However, if £ n is further increased, to 2k% T 
(the value used in Fig. [2]), yield generally decreases, consistent 
with our previous finding that density fluctuations associated 
with nonspecific association tend to conflict with the 'symme- 
try fluctuations' required to form the square lattice^. Further, 
the nonspecific interaction acts to favor non-hierarchical path- 
ways over hierarchical ones. For £ n = peak yield (which 
is only moderate) does not differ appreciable from the non- 
hierarchical pathway sector (lower left of the assembly region) 
to the hierarchical pathway sectors (top left and bottom right 
portions of the assembly region). For £ n = 1, by contrast, the 
monomer and dimer pathways are noticably better than the 
tetrameter one. For still larger nonspecific interaction, £ n = 2, 
best yield is found in the region of non-hierarchical assembly. 
Supplemental Fig. 1 shows that the same trends result when 
using the unsealed definition of yield, though the tetramer 
pathway is not noticeably worse than the dimer and monomer 
pathways at £ n = 1 . The relative efficiency of the direct path- 
way at large nonspecific interactions suggests that dimers and 
tetramers are more susceptible to nonspecific aggregation than 
are monomers, and that this aggregation proves deleterious to 
yield. 

This suggestion is confirmed by the 'pathway dia- 
grams 'E3EI shown in Fig. |5jb). These denote the largest 
fraction of specific microscopic environments seen along 
trajectories at each point in phase space. These diagrams 
confirm that nonspecific aggregation of oligomers (dimers and 
tetramers) plagues much of phase space when the nonspecific 
interaction is large, while misbinding (the formation of only 
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Fig. 5 Weak nonspecific interaction is generally beneficial, but diminishes the effectiveness of hierarchical pathways relative to the direct one. 
(a) Phase diagram and dynamic yield after 10 7 MC cycles as a function of specific interaction strengths e tet and e^im f° r three nonspecific 
interaction strengths and a concentration (j) = 0.1. (b) Pathway diagrams (see text) illustrate that the regions of good assembly are bounded by 
regions exhibiting three types of kinetic trapping: misbinding, aggregation of oligomers (dimers or tetramers), and the formation of trimers. 
Weak nonspecific interaction (middle row) facilitates assembly, while stronger nonspecific interaction (bottom row) begins to suppress it. 
Furthermore, nonspecific modes of binding disfavor hierarchical pathways: moving from top to bottom, strengthening nonspecific interactions 
first disfavors assembly via tetramers (compare first and second rows), then disfavors assembly via dimers (compare second and third rows). 
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Fig. 6 Comparison of dynamics and thermodynamics reveals principles of good assembly. Dynamic quantities (second row) respond to 
variations in the free energies of bulk phases in free energies (third and fourth rows), along paths, labeled s, of constant thermodynamic 
driving force (top row) through the phase diagrams of Fig. [5] Shown are free energies of pure fluid and crystal phases; the free energies of 
equilibrium coexisting phases are equal or slightly lower. Hierarchical pathways owe their existence to dimer and tetrameter phases close to or 
lower than the parent phase in free energy; when these low-lying oligomer phases are close-packed (see right column, bottom panel), dynamic 
aggregation of oligomers leads to a reduction in yield (see right column, second panel). Non-hierarchical pathways resulting in high yield tend 
to occur when nothing lies between the monomer fluid and the target crystal in free energy. 
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Fig. 7 Building blocks designed to assemble directly perform better than building blocks designed to assembly hierarchically, even under 
optimal conditions, (a) 'Pathway design diagram' showing yield after 10 7 MC cycles at = 0.1, for a wide range of building block designs 
defined by different interaction strength ratios Ctet/^n and £dim/ £ n- The absolute values of these parameters are chosen, at each point on the 
diagram, in order to optimize assembly; building blocks identified in this way are therefore the best-designed representatives of certain kinds 
of pathway. The symbol °o denotes unbreakable specific interactions, and the row (resp. column) marked 'D' (resp. 'T') correspond to 
simulations initialized with dimers (resp. tetramers) bound by unbreakable interactions. Comparison of yield and (b) pathways demonstrates 
that while oligomer aggregation has been largely eliminated, the trimer kinetic trap is always present when strong tetramer-forming bonds 
exist. Symbols show the points of intersection of this diagram with those shown in Fig.[8| 



two out of three specific bonds) is the prevalent kinetic trap in 
the upper right sector of all diagrams, where one or both of 
the specific interactions are large. These two types of kinetic 
trapping bound the regions of optimal assembly, while the 
trimer 'incomplete building block' trap poses an additional 
impediment to assembly in part of the tetramer-dominated 
phase space. 

Comparison of dynamics and thermodynamics reveals prin- 
ciples of good assembly. Analysis of our analytic mean-field 
theory reveals the thermodynamic underpinning of the differ- 
ent dynamic pathways and modes of kinetic trapping seen in 
Fig. [5] In Fig. [6] we show the three yield diagrams from Fig. [5] 
above a set of data collected along a contour of thermody- 
namic driving force J^, chosen to intersect the region of best 
yield in each case. Distance along the contour is labeled by 
the parameter s (see top row of Fig. [6]). On the second row 
of Fig. [6] we show self-assembly yield and the largest fraction 
of specific microscopic environments seen in dynamic simu- 
lations perfomed at points along each contour. Below simula- 
tion results we show bulk free energies of 8 different phases 
from our analytic theory. As well as the monomer fluid phase 
(the parent phase) and the target square lattice structure, we 
show free energies of fluids of dimers, trimers and tetramers 
(third row of figure), and free energies of close-packed phases 
of dimers and tetramers (fourth row of figure). The arrow be- 
low each column corresponds to the contour drawn on each 



phase diagram at the top. Supplemental Fig. 2 shows a similar 
version of Fig. fusing the unsealed definition of yield. 

Comparison of the dynamic and thermodynamic data 
yields two conclusions. First, hierarchical pathways owe their 
existence to phases (fluid and/or close-packed) of dimers and 
tetramers that lie between the parent phase and the target 
structure in free energy (or sit slightly above but close to 
the parent phase). Similar behavior is seen in the model of 
RefH This finding recalls Ostwald's rule of stagesEHSS the 
heuristic that asserts that phases closest to the parent phase 
in free energy are visited before the stable phase. Second, 
good assembly occurs where neither close-packed oligomer 
nor trimer phases lie near or below the parent phase in free 
energy. The connection between the metastability of these 
phases and kinetic trapping is made clear by noting that 
microscopic environments in the second row have appreciable 
values where the associated phases (same colors) in the third 
and fourth rows lie near or below the monomer fluid phase. 
This observation allows us to refine the second design rule 
governing best assembly in this model: do not allow dense 
liquid phases, extended close-packed solid phases, or fluids 
of incomplete building blocks to lie close in free energy to the 
target crystal structure. 

The best-designed building blocks assemble directly rather 
than hierarchically. Thus far we have focused on slices 
through parameter space in which the nonspecific interaction 
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Fig. 8 Apparently similar temperature-versus-packing fraction phase diagrams can conceal markedly different dynamics, (a) 
Temperature-packing fraction phase diagrams for three building blocks representative of non-hierarchical, dimer-dominated, and 
tetramer-dominated assembly, from top to bottom. Symbols show the points of intersection of these diagram with that shown in Fig. [7] 
equivalent to the temperature of peak yield for packing fraction =0.1. The associated (b) pathway diagrams show that thermodynamic 
phase diagrams that look similar in this representation can harbor markedly different dynamics. In particular, tetramerin assembles via a 
trimer-rich intermediate, which acts as the dominant kinetic trap at high temperature and low packing fraction. 
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is held fixed and the specific interactions are varied. In order to 
more thoroughly test whether hierarchical assembly pathways 
can outperform direct assembly pathways, we must design 
building blocks to avoid stabilizing unwanted phases while 
maintaining hierarchical assembly pathways. Since the en- 
ergetic part of the relative stability of target and close-packed 
phases are controlled by the ratios £tet/ £ n an d £dim/£n, tuning 
these ratios can alternately select hierarchical or direct path- 
ways while ensuring that dense phases do not compete with 
the target square lattice. Fig. [7] shows yield and pathway dia- 
grams as functions of ratios £dim/£n and £tet/£n, where pack- 
ing fraction is held fixed at 10% but the magnitude of the 
energies at each point is optimized for yield. The rightmost 
and topmost portions of these diagrams represent unbreak- 
able tetramer-forming and dimer-forming interactions, respec- 
tively, and denote simulations initialized from well-mixed ini- 
tial configurations (labeled '«>') or from gases of tetramers (la- 
beled T) or dimers (labeled 'D'). Fig. [7] (a) shows that un- 
der optimal conditions, the non-hierarchical designs (near the 
diagonal £di m /^tet = 2) assemble best, dimer- stabilizing de- 
signs (above the diagonal) assemble nearly as well (but not 
better), and tetramer- stabilizing designs (to the right of the 
diagonal) assemble much less well. Fig. [7] (b) confirms that 
this hierarchy is a result of kinetic trapping: while the aggre- 
gated oligomer trap has been all but eliminated, the trimer 'in- 
complete building block' kinetic trap cannot be eliminated in 
the presence of strong tetramer-forming interactions. When 
this trap is circumvented artificially, by seeding simulations 
with a gas of tetramers (columns labeled T), yield in the 
tetramer-dominated sector is much improved. Supplemental 
Fig. 3 shows that a similar conclusions result when using the 
unsealed definition of yield, though only strongly tetramer- 
stabilizing designs (£t e t/^n = 00 but not £t e t/^n < 10) suppress 
yield. 

Finally, we show in Fig. [5] (and Supplemental Fig. 4) 
the behavior of three building block designs representative 
of non-hierarchical assembly, dimer-dominated assembly, and 
tetramer-dominated assembly, respectively, in a more con- 
ventional temperature- versus-packing fraction representation. 
Symbols show the points of intersection between these dia- 
grams and the one shown in Fig. [7] Comparison of phase di- 
agrams, overlaid yield plots (panel a) and pathway diagrams 
(panel b) illustrates that three building blocks with similar- 
looking solubility curves and identical stable phases can dis- 
play a wide variety of dynamic pathways to self-assembly, and 
exhibit markedly different kinds of kinetic trapping. 

4 Conclusion 

The self-assembly of the hierarchical structure studied here 
can occur via many dynamical pathways. These pathways can 
be non-hierarchical or hierarchical, can involve intermediates 



with counterparts on the equilibrium phase diagram - such as 
dimers, tetramers, and dense fluid phases^ - or intermediates 
with no such counterpart, such as gel-like structures. Hierar- 
chical assembly in our model is no better or worse than non- 
hierarchical assembly in the absence of nonspecific interac- 
tions between building blocks, but rapidly becomes worse than 
non-hierarchical assembly when the possibility of nonspecific 
association exists. In our model, hierarchical assembly is also 
vulnerable to the formation of incomplete higher-order build- 
ing blocks (trimers) not co mmen surate with the target struc- 
ture. Models of virus capsid 27 ! 32 " 35 ! and protein complex^ as- 
sembly suggest that the 'incomplete building block' may be 
a general nuisance to the assembly of complicated structures, 
but that nature often mitigates this nuisance by favoring multi- 
step hierarchical assembly, such as the formation of capsomer 
hexamers as trimers of dimersPES. 

In general, best assembly is achieved in our model when 
the dynamical pathway is simplest, comprising sequential ad- 
dition of monomers to a single structure. The correspond- 
ing rules for design of the model's self-assembly pathways, 
gleaned from this work and our previous onesE323 appear 
to stem from two simple thermodynamic prescriptions: 1) 
make the free energy gap between the target phase and the 
most stable fluid phase of order k^T \ and 2) ensure that no 
other dense phase (liquid of close-packed solid of monomers 
or oligomers) or incomplete-building-block phase falls within 
this gap. Achieving this prescription requires tuning the three 
microscopic interactions possessed by the model, and, in par- 
ticular, making the nonspecific interaction weak. The connec- 
tion between underlying thermodynamics and self-assembly 
dynamics we have made helps us place the model in a broader 
context: its assembly is similar in certain regards to that of 
virus capsids, but mostly dissimilar to the assembly of close- 
packed crystals. Continued classification of the assembly 
pathways of numerous types of structur e ! 53 " 57 ! should eventu- 
ally reveal whether apparently dissimilar systems can nonethe- 
less be placed in similar categories of behaviors. 
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